首页> 外文OA文献 >On the Importance of Super-Gaussian Speech Priors for Machine-Learning Based Speech Enhancement

【2h】

On the Importance of Super-Gaussian Speech Priors for Machine-Learning Based Speech Enhancement

机译：论超高斯语音启动机器学习的重要性基于语音增强

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

For enhancing noisy signals, machine-learning based single-channel speechenhancement schemes exploit prior knowledge about typical speech spectralstructures. To ensure a good generalization and to meet requirements in termsof computational complexity and memory consumption, certain methods restrictthemselves to learning speech spectral envelopes. We refer to these approachesas machine-learning spectral envelope (MLSE)-based approaches. In this paper we show by means of theoretical and experimental analyses thatfor MLSE-based approaches, super-Gaussian priors allow for a reduction of noisebetween speech spectral harmonics which is not achievable using Gaussianestimators such as the Wiener filter. For the evaluation, we use a deep neuralnetwork (DNN)-based phoneme classifier and a low-rank nonnegative matrixfactorization (NMF) framework as examples of MLSE-based approaches. A listeningexperiment and instrumental measures confirm that while super-Gaussian priorsyield only moderate improvements for classic enhancement schemes, forMLSE-based approaches super-Gaussian priors clearly make an importantdifference and significantly outperform Gaussian priors.

机译：为了增强噪声信号，基于机器学习的单通道语音增强方案利用了有关典型语音频谱结构的先验知识。为了确保良好的概括性并满足计算复杂性和内存消耗方面的要求，某些方法将其自身限制为学习语音频谱包络。我们将这些方法称为基于机器学习频谱包络（MLSE）的方法。在本文中，我们通过理论和实验分析表明，对于基于MLSE的方法，超高斯先验可以降低语音频谱谐波之间的噪声，而这是使用诸如Wiener滤波器的高斯估计器无法实现的。为了进行评估，我们使用基于深度神经网络（DNN）的音素分类器和低秩非负矩阵分解（NMF）框架作为基于MLSE的方法的示例。一项听力实验和仪器测量结果证实，尽管超高斯先验仅对经典增强方案产生了适度的改进，但是基于MLSE的方法，超高斯先验显然具有重要的区别，并且明显优于高斯先验。

著录项

作者
Rehr, Robert; Gerkmann, Timo;
展开▼
作者单位

展开▼
年度 2018
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Improved Subspace-Based Single-Channel Speech Enhancement Using Generalized Super-Gaussian Priors [J] . Jesper Jensen, Richard Heusdens IEEE transactions on audio, speech and language processing . 2007,第3期

机译：使用广义超高斯先验的改进的基于子空间的单通道语音增强
2. Phase-Based Dual-Microphone Speech Enhancement Using A Prior Speech Model [J] . Guangji Shi, Parham Aarabi, Hui Jiang IEEE transactions on audio, speech and language processing . 2007,第1期

机译：使用先验语音模型的基于相位的双麦克风语音增强
3. Wavelet Packet Transform based Speech Enhancement via Two-Dimensional SPP Estimator with Generalized Gamma Priors [J] . Sun Pengfei, Qin Jun Archives of acoustics . 2016,第3期

机译：通过带有通用Gamma先验的二维SPP估计器的基于小波包变换的语音增强
4. Multichannel Speech Enhancement Based on Speech Spectral Magnitude Estimation Using Generalized Gamma Prior Distribution [C] . Tran Huy Dat, Takeda K., Itakura F. IEEE International Conference on Acoustics, Speech and Signal Processing . 2006

机译：基于语音谱幅度估计的多通道语音增强，广义伽玛先前分布
5. Speech enhancement based on perceptual loudness and statistical models of speech. [D] . Zhang, Wei. 2009

机译：基于感知响度和语音统计模型的语音增强。
6. An individualized super-Gaussian single microphone Speech Enhancement for hearing aid users with smartphone as an assistive device [O] . Chandan K A Reddy, Nikhil Shankar, Gautam S Bhat, -1

机译：使用智能手机作为辅助设备的助听器用户的个性化超高斯单麦克风语音增强
7. Spectral Domain Speech Enhancement Using HMM State-Dependent Super-Gaussian Priors [O] . Mohammadiha, Nasser, Martin, Rainer, Leijon, Arne 2013

机译：使用HMM状态相关的超高斯先验来增强频谱域语音

On the Importance of Super-Gaussian Speech Priors for Machine-Learning Based Speech Enhancement

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅